How much measurement independence is needed in order to demonstrate nonlocality? 
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If nonlocality is to be inferred from a violation of Bell's inequality, an important assumption is that 
the measurement settings are freely chosen by the observers, or alternatively, that they are random 
and uncorrelated with the hypothetical local variables. We study the case where this assumption 
is weakened, so that measurement settings and local variables are at least partially correlated. As 
we show, there is a connection between this type of model and models which reproduce nonlocal 
correlations by allowing classical communication between the distant parties, and a connection 
with models that exploit the detection loophole. We show that even if Bob's choices are completely 
independent, all correlations obtained from projective measurements on a singlet can be reproduced, 
with the correlation (measured by mutual information) between Alice's choice and local variables 
less than or equal to a single bit. 

PACS numbers: 



Quantum nonlocality, whereby particles appear to in- 
fluence one another instantaneously even though they are 
widely separated in space, is one of the most remark- 
able phenomena known to modern science pM]. Histor- 
ically, this peculiar prediction of quantum theory trig- 
gered many debates and even doubts about its validity. 
Today, it is a well established experimental fact [5] . 

The profound implications of quantum nonlocality for 
our world view remain controversial. But it is no longer 
considered as suspicious, or of marginal interest. It is 
central to our understanding of quantum physics, and 
in particular it is essential for the powerful applications 
of quantum information technologies. In 1991, A. Ek- 
ert showed how shared entanglement could enable dis- 
tant partners to establish a shared cryptographic key [8] . 
Ekert's intuition is quite simple - if there are no local 
variables, then no adversary could possibly hold a copy 
of these variables - yet it came decades after the birth 
of quantum mechanics. In 2005, Barrett and co-workers 
used this intuition to show how, with no further assump- 
tions about quantum theory, nonlocal correlations alone 
could ensure security of a secret key [S]. Acm and co- 
workers extended this result, showing how to generate 
secret keys using simple nonlocal correlations that violate 
the well-known CHSH inequality [lOj . Simultaneously, it 
was realized that Bell inequalities are the only entangle- 
ment witnesses that can be trusted in cases where the 
dimensions of the relevant Hilbert spaces are unknown 

In an experimental demonstration of quantum nonlo- 
cality, measurements are performed on separated quan- 
tum systems in an entangled state, and it is shown that 
the measurement outcomes are correlated in a manner 
that cannot be accounted for by local variables. In order 
to conclude that nonlocality is exhibited, it is crucial for 
the analysis that the choices of which measurement to 
perform are freely made by the experimenters. Alterna- 
tively, they must be random and uncorrelated with the 
hypothetical local variables. It is well known that if the 



measurement settings are not random, but are in fact 
determined by the local variables, then arbitrary corre- 
lations can be reproduced [TT] . 

Here we reverse the argument. Taking for granted that 
quantum nonlocal correlations are produced, how inde- 
pendent must the measurement settings be assumed to 
be in order to rule out an explanation in terms of lo- 
cal variables [5]? We show that all correlations obtained 
from projective measurements on a singlet state can be 
reproduced with the mutual information between local 
variables and the measurement setting on one side less 
than or equal to one bit. This is not only of fundamen- 
tal interest. If a cryptographic protocol is relying on the 
nonlocality of correlations for security, it is vital that 
there is no underlying local mechanism that an eaves- 
dropper may be exploiting. The result shows that ran- 
dom number generators used to determine settings must 
be assumed to have a very high degree of independence 
from the particle source. 

Bell Experiments. In an experimental test of a Bell 
inequality, two experimenters - traditionally named Al- 
ice and Bob - are spatially separated. They each control 
a quantum system, and the joint state of these two sys- 
tems is an entangled state. Alice and Bob each perform 
a measurement on their quantum system, in such a way 
that the measurements are spacelike separated. Alice's 
measurement is chosen from a finite set of possibilities, as 
is Bob's. Call these measurement settings the inputs and 
label them x for Alice's choice and y for Bob's choice. 
The outcome of Alice's measurement (her output) is la- 
beled a and Bob's b. By repeating this procedure many 
times and collating the data, Alice and Bob can estimate 
conditional probabilities of the form P(a, b\x, y). 

The question is then, when can the correlations pro- 
duced in an experiment like this be explained by under- 
lying local variables? Locality here implies that the out- 
come of Alice's measurement cannot be directly influ- 
enced by Bob's choice of which measurement to perform, 
and vice versa. More precisely, let the hypothetical un- 
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FIG. 1: Schematic illustration of an experiment for testing 
quantum nonlocality. 



derlying variables be denoted A, and assume a distribu- 
tion -P(A). Then Bell locality is the condition that 

P(a,b\x,y,X) = P(a\x,X) ■ P(b\y,X). (1) 

If the correlations could in principle be explained as aris- 
ing from underlying local variables, then they can be 
written in the form 

P(a, b\x, y) = Yl P W ■ A ) ' A )- ( 2 ) 

A 

(Here and throughout, sums should be replaced with inte- 
grals when the variable is not discrete.) The correlations 
cannot be written in this form if and only if they violate 
a Bell inequality. 

Suppose that correlations are obtained which do vio- 
late a Bell inequality. What assumptions are necessary 
to conclude that this is indicative of nonlocality? There 
is a large literature on this topic, and a reader could do 
a lot worse than refer to Bell's original papers |T|. But it 
is uncontroversial that the standard argument needs to 
assume that the inputs x,y are freely, or at least inde- 
pendently and randomly chosen by Alice and Bob. 

Non-independent measurement choices. We 
wish to analyze the case in which measurement settings 
can be correlated with local variables. Hence in addition 
to a set of correlations P(a,b\x,y), assume a distribu- 
tion over inputs P(x,y). A correlated- settings model is 
defined by a variable A, and an overall joint probabil- 
ity distribution Pcs(a, b, x, y, A). The correlated-settings 
model reproduces the given correlations and input dis- 
tribution if Pcs(o,, b\x, y) = P(a, b\x,y) and Pcs(x,y) = 
P(x,y). The model is Bell-local if the distribution Pes 
also satisfies an equation of the form of Eq ([!]). In this 
case 

Pcs{a,b\x,y) = ^P cs (A|a;,y) • P cs (a,b\x,y, X) 

A 

- J2Pcs(X\x,y) ■ Pcs(a\x,\) ■ P GS (b\y,X). (3) 

A 



Note the similarity with Eq. The usual kind of 

model, in which Alice's and Bob's inputs are assumed 
independent of the local variables, is a special case with 
Pcs{x,y\X) = Pcs(x,y)- By Bayes' rule, this also gives 
Pcs(X\x,y) = Pcs(A), and Eq. Q reduces to an equa- 
tion of the form of Eq. 

Another special case is the extreme case where the in- 
puts are completely determined by A, so that x = /(A) 
and y = g(X) for functions / and g. Here, for all x, y, A, 
Pcs(x, y\X) — or 1. Any correlations can be produced 
by a Bell-local model of this form [TT] . 

In other models, x and y will be neither independent 
nor determined by A. In general there are many numeri- 
cal measures of the degree to which they are correlated. 
One particularly natural and simple measure is the mu- 
tual information. Thus a correlated-settings model can 
be characterized by the mutual information between the 
measurement choices x, y and A: 

I(x, y : A) = H(x, y) + H(X) - H(x, y, A), (4) 

where H is the Shannon entropy. When x and y are 
independent of A, I(x, y : A) = 0. If x and y are functions 
of A on the other hand, then I(x, y : X) = H{x, y). Since 
in general I(x,y : A) < H(x,y), this means that the 
mutual information is maximal (with respect to a fixed 
distribution over inputs P(x,y)). 

In the context of correlated-settings models, the ques- 
tion is no longer whether quantum correlations violate 
a Bell inequality. The question is, how large must 
I(x,y : A) be if the correlations are to be reproduced 
within a Bell-local model? Alternatively, for a fixed de- 
gree of correlation I(x, y : A), do quantum correlations vi- 
olate a Bell inequality by a sufficiently large margin that 
the model cannot possibly be Bell-local? This depends 
on the precise experiment. For example, if the input al- 
phabet consists of only two choices, as with the CHSH 
inequality, then any correlations can be reproduced by a 
correlated-settings model with I(x,y : X) < 1. But the 
question remains open for larger input alphabets. Intu- 
itively one may think that for large alphabets, if Alice's 
choice is only mildly correlated with the variable A, then 
a sufficient violation of a suitable Bell inequality should 
still rule out locality. But we shall see that this intuition 
is wrong, at least for maximally entangled qubits. 

A connection to communication cost. One way 
to simulate quantum correlations, including nonlocal cor- 
relations, is for Alice and Bob to communicate with one 
another after inputs are chosen, but before outputs are 
produced. Any correlations can be produced this way if 
Alice and Bob share random data, and Alice simply in- 
forms Bob of her input choice. Hence, if Alice is choosing 
from ua possible inputs, then an obvious upper bound 
on how many classical bits need to be communicated is 
given by log ha- The minimum number of bits that must 
be communicated, on the other hand, provides a natural 
way of quantifying the amount of nonlocality inherent in 
a given set of correlations. This interesting line of re- 
search was started by T. Maudlin [2] and independently 
by Tapp et al. and by Steiner [T2J[T2]. It culminated with 
a model of Toner and Bacon, who proved that one single 



bit of communication from Alice to Bob is sufficient to 
simulate all correlations obtained from arbitrary projec- 
tive (i.e. Von Neumann) measurements on two qubits in 
a maximally entangled state jH] . ft was extended to all 
two-party correlations, though ignoring the marginals, in 

Em. 

A slightly more precise description of a communication 
model for simulating quantum correlations is as follows. 
The random data shared between Alice and Bob is a 
variable /i, with distribution Pc(/x). Assuming that Alice 
communicates first, Alice sends a bit string c\ to Bob. 
The string C\ can depend on fj, and on Alice's input x. It 
may also depend on further random data held by Alice, 
but this data can without loss of generality be absorbed 
into y,. Hence assume that c\ = c±(fi,x). Depending 
on the protocol Bob may now send a bit string c 2 to 
Alice, where again without loss of generality, we assume 
that C2 is a function of fj,, c\ and y. This continues for 
a total of k messages. The entire conversation between 
Alice and Bob on a particular round is the sequence m = 
ci, C2, . . . , Cfc, and is a function to — f(x i y,fx) of x, y, 
and /j,. Finally, Alice outputs a, where again absorbing 
all randomness into fx, a can be assumed to be a function 
a = g(x, /i, to) of her input, the shared random data and 
the conversation. Similarly, Bob outputs b, which is a 
function b — h(y, /it, m). The protocol then terminates. 

For each pair of inputs, a communication model de- 
fines a distribution Pc(a, b, to, [i\x, y), and reproduces 
correlations P(a,b\x,y) if Pc(a,b\x,y) = P{a,b\x,y). 
If a distribution over inputs P(x, y) is also given, the 
conversation m has a well-defined distribution P(m) — 
y Pc(f^\x, y) ■ P(x, y). Let the entropy of this distri- 
bution be H(m). 

The connection with correlated-settings models is ex- 
pressed in the following 

Theorem 1 Consider a fixed input distribution P(x,y), 
and a communication model with total conversation m. 
Then there exists a Bell-local correlated-settings model, 
which reproduces the same correlations and P(x,y), with 
I(x,y : A) < H(m). 

Proof. Given P(x,y) and a communication model as 
above, construct a correlated-settings model as follows. 
Define A as the pair A = (//, m). In the communication 
model, m is the conversation between Alice and Bob, 
but in the correlated-settings model there is no commu- 
nication, and A represents the underlying variable. De- 
fine the correlated-settings model by Pcs(a,b, x 7 y, \) = 
P(x, y)Pc{a 1 b, /i, m\x, y). It is easy to see that the result- 
ing model reproduces P(x,y) and the same correlations 
as the communication model. Bell locality is satisfied 
since 

Pcs(a, b\x,y, A) = Pc(a, b\x, y, /i, to) 

= P c (a\x,n,m) ■ P c (b\y,[i,m) 
= P cs (a\x,X)-P cs (b\y,\). 

The mutual information between A and {x, y) is 

I(X:x,y) = : x,y) + I(m : x,y\n) (5) 



= + H{m\n) - H(m\fi,x,y) (6) 
= H{m\fj) (7) 
< H(m). (8) 

The first line uses the chain rule [IB], the second line 
follows from the fact that [i is assumed to be independent 
of the inputs x,y, and the third line holds because to is 
a function of fi and x, y. This concludes the proof. 

Intuitively, A = (/i, to) restricts Alice's and Bob's 
choices to inputs (x,y) such that to = f(x,y,/j,). We 
stress the generality of this result: the communication 
model may have involved two way communication, and 
may have been such that the communication on a given 
run is unbounded. As long as H(m) is finite, the bound 
is useful. 

We now apply the theorem with reference to the Toner- 
Bacon communication model. Here, x and y are arbitrary 
projective measurements. The shared random data fi 
takes the form of two vectors on the Bloch sphere and 
the distribution P(fi) is such that the two vectors are 
independent, with each uniformly distributed. For our 
purposes most of the details of the model do not matter, 
and we refer the reader to |14] for a description and proof 
that it reproduces quantum correlations for projective 
measurements on a singlet. It suffices to know that each 
round, the conversation to is a single bit communicated 
from Alice to Bob. The bit to = f(x, /i) is a function of 
Alice's input x and /i. 

The resulting correlated-settings model also repro- 
duces all correlations from projective measurements on a 
singlet. In general, the actual value of I(x, y : A) depends 
on the distribution over inputs P(x, y). The theorem tells 
us that for any such distribution, I{x, y : A) < H(m) < 1, 
since to is a single bit. In particular, if Alice's and Bob's 
inputs are independent, with each chosen from a uniform 
distribution over all possible directions, it is easy to verify 
that I(x,y : A) w 0.85. 

In a typical Bell-type experiment with a singlet, Al- 
ice and Bob will be choosing from finite sets of measure- 
ments. The Toner-Bacon model can still be applied, with 
the distribution P(x,y) having support only on these fi- 
nite sets. In this case too, I(x,y : A) < 1. The Bell-local 
correlated-settings model reproduces the quantum corre- 
lations no matter how large the input alphabets, and no 
matter what Bell inequality is being tested. 

Finally, in the Toner-Bacon model, Bob's setting is of 
course independent from the communication he receives 
from Alice. It follows that in the derived correlated- 
settings model, Bob's setting is independent from A, i.e., 
I(y : A) = 0. 

The detection loophole. In a real Bell experiment, 
detection is inefficient. A typical analysis of the exper- 
iment estimates the correlations P(a, b\x, y) from those 
runs on which both Alice's and Bob's detectors clicked, 
and simply ignores all the other runs. Such an analysis 
is valid, as long as it is assumed that the probability of 
a detector clicking is independent of the hypothetical lo- 
cal variables. This is sometimes called the fair sampling 
assumption. 

It is possible to reproduce nonlocal correlations with 



a model that is Bell-local, but which violates the 
fair sampling assumption [17l [18]. Denote the event 
that Alice's (Bob's) detector clicks by Da (D b ). A 
detection- efficiency model is defined by a variable A, 
with distribution Pde(X), independent of x,y, and for 
each x, y a distribution PrjE(a,b, DA,D B \x,y, A). Bell- 
locality is the condition that Pee(o-, b, Da, D b \x, y, A) = 
Pde{o,, Da\x, X)-PDE{b, D B \y, A). The model reproduces 
correlations P(a,b\x,y) if 

P(a, b\x, y) = P DE {a, b\x, y, D A ,D B ). (9) 

The efficiencies of Alice's detectors in this model are 
given by P D e{D a \x) = Y,\ p de{X)Pde{D a \x, A), and 
similarly for Bob's. If the detection efficiencies in a real 
experiment are high enough, and if a Bell inequality 
is violated by a large enough margin, then a Bell-local 
detection-efficiency model reproducing the correlations 
can be ruled out. 

Given a distribution P(x, y), and a detection-efficiency 
model reproducing correlations P(a,b\x,y), it is easy to 
construct a correlated-settings model which also repro- 
duces P(x,y) and P(a, b\x, y). Simply define 

Pcs(a, b, x, y, A) = P(x. y) ■ P D e(o,, b, X\x, y, D A , D B ). 

(10) 

It is easy to show, using Eq. that the correlated- 
settings model reproduces P(x,y) and P(a,b\x,y). 

The construction can be applied, for example, to the 
Gisin-&-Gisin detection-efficiency model [19], which re- 
produces correlations from arbitrary projective measure- 
ments on a singlet. The efficiencies are independent of 
x and y and are given by P(Da) = 1/2, P(D B ) = 1. 
In this model A is a vector on the Bloch sphere, and 
Pde(M x >ViDa,Db) = \X.x\/2n where x denotes the 
Bloch vector representing the measurement x. In the 
resulting correlated-settings model, I(x,y : A) depends 
on P(x,y). If x and y are independently and uniformly 
distributed, it is easy to show that I(x, y : X) — I(x : 
A) ~ 0.28. This is an even lower value than was obtained 
above using the Toner-Bacon model. 

Finally, note that in the case of communication mod- 
els, there was a connection between I(x, y : A) and the 



amount of communication (as measured by the entropy 
of the conversation's distribution). With detection ef- 
ficiency models, we expect there to be a connection be- 
tween I(x, y : A) and the detection efficiencies. Exploring 
this connection is an interesting direction for future work. 

Conclusion. It is well known that in order to derive 
nonlocality from violation of a Bell inequality, one has 
to assume that there is no correlation between the hypo- 
thetical local variables A and the experimenters' measure- 
ment choices x and y. We have investigated how much 
correlation could be allowed if quantum predictions are 
to remain incompatible with local variables. Surprisingly, 
no matter how large the alphabet size of the inputs, cor- 
relations from projective measurements on a singlet can 
be reproduced, with the mutual information between Al- 
ice's input and local variables not more than one bit, and 
Bob's input completely independent. This deserves fur- 
ther investigation. Future work should analyze more gen- 
eral scenarios, involving generalized measurements, par- 
tial entanglement [20] , higher dimensional Hilbert spaces 
and more than two parties. It would also be instructive 
to consider measures of correlation other than the mutual 
information. 

Finally, let us emphasize the change in paradigm since 
the old EPR paper [1]. If, contrary to EPR, one accepts 
nonlocality as a fact, then not only can one develop pow- 
erful applications in quantum information science, like 
device- independent quantum key distribution [5] 110] , but 
moreover one can upper bound the lack of free choice of 
the players ! 
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